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Abstract. The motivation of semantic wikis is to make acquisition, 
maintenance, and mining of formal knowledge simpler, faster, and more 
flexible. However, most existing semantic wikis have a very technical in- 
terface and are restricted to a relatively low level of expressivity. In this 
paper, we explain how AceWiki uses controlled English — concretely At- 
tempto Controlled English (ACE) — to provide a natural and intuitive 
interface while supporting a high degree of expressivity. We introduce re- 
cent improvements of the AceWiki system and user studies that indicate 
that AceWiki is usable and useful. 

1 Introduction 

We present an approach how semantic wikis and controlled natural language 
can be brought together. This section gives a short introduction into the fields 
of semantic wikis and controlled natural languages. 

1.1 Semantic Wikis 

Semantic wikis are a relatively new field of research that started in 2004 when 
semantic wikis were introduced in 1 16] describing the Platypus Wiki system. Dur- 
ing the last years, an active community emerged and many new semantic wiki 
systems were presented. Semantic wikis combine the philosophy of wikis (i.e. 
quick and easy editing of textual content in a collaborative way over the Web) 
with the concepts and techniques of the Semantic Web (i.e. enriching the data 
on the Web with well-defined meaning) . The idea is to manage formal knowledge 
representations within a wiki environment. 

Generally, two types of semantic wikis can be distinguished: On the one hand, 
there are text-centered approaches that enrich classical wiki environments with 
semantic annotations. On the other hand, logic-centered approaches use seman- 
tic wikis as a form of online ontology editors. Semantic MediaWiki [7], Ike Wiki 
[T2~] , Sweet Wiki |J, and HyperDEWiki [T^ are examples of text-centered se- 
mantic wikis, whereas OntoWiki pQ and myOntology [T5] are two examples of 
logic-centered semantic wikis. Web-Protege [17] is the Web version of the popular 



Protege ontology editor and can be seen as another example of a logic-centered 
semantic wiki, even though its developers do not call it a "semantic wiki" . In gen- 
eral, there are many new web applications that do not call themselves "semantic 
wikis" but exhibit many of their characteristic properties. Freebas^j] KnoodQ 
SWIRRlj^] and Twinc|^]are some examples. 

Semantic wikis seem to be a very promising approach to get the domain 
experts better involved in the creation and maintenance of ontologies. Semantic 
wikis could increase the number and quality of available ontologies which is 
an important step into the direction of making the Semantic Web a reality. 
However, we see two major problems with the existing semantic wikis. First, 
most of them have a very technical interface that is hard to understand and use 
for untrained persons, especially for those who have no particular background in 
formal knowledge representation. Second, existing semantic wikis support only 
a relatively low degree of expressivity — mostly just "subject predicate object"- 
structures — and do not allow the users to assert complex axioms. These two 
shortcomings have to be overcome to enable average domain experts to manage 
complex ontologies through semantic wiki interfaces. 

In this paper, we will argue for using controlled natural language within se- 
mantic wikis. The Wiki@ni[^] system follows a similar approach. It uses controlled 
natural language (concretely Rabbit and ACE) for verbalizing OWL axioms. In 
contrast to our approach, users cannot create or edit the controlled natural lan- 
guage sentences directly but only the underlying OWL axioms in a common 
formal notation. Furthermore, no reasoning takes place in Wiki@nt. 



1.2 Attempto Controlled English 

Controlled natural languages are subsets of natural languages that are controlled 
(in both syntax and semantics) in a way that removes or reduces the ambiguity of 
the language. Recently, several controlled natural languages have been proposed 
for the Semantic Web |14j . The idea is to represent formal statements in a way 
that looks like natural language in order to make them better understandable 
to people with no background in formal methods. 

Attempto Controlled English (ACE^is a controlled subset of English. While 
looking like natural English, it can be translated automatically and unambigu- 
ously into logic. Thus, every ACE text has a single and well-defined formal 
meaning. A subset of ACE has been used as a natural language front-end to 
OWL with a bidirectional mapping from ACE to OWL jS] . This mapping covers 
all of OWL 2 except data properties and some very complex class expressions. 

ACE supports a wide range of natural language constructs: nouns (e.g. 
"country"), proper names ("Zurich"), verbs ("contains"), adjectives ("rich"), singu- 
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lar and plural noun phrases ( "a person" , "some persons" ) , active and passive voice 
("owns", "is owned by"), pronouns ("she", "who", "something"), relative phrases 
("who is rich", "that is owned by John"), conjunction and disjunction ("and", "or"), 
existential and universal quantifiers ("a", "every"), negation ("no", "does not"), 
cardinality restrictions ("at most 3 persons"), anaphoric references ("the country", 
"she"), questions ("what borders Switzerland?"), and much more. Using these ele- 
ments, one can state ACE sentences like for example 

Every person who writes a book is an author. 

that can be translated into its logical representation: 

\/AVB(person(A) A write(A, B) A book(B) -> author (A)) 

In the functional-style syntax of OWL, the same statement would have to be 
expressed as follows: 

SubClassOf ( 

IntersectionOf ( 
Class ( : person) 
SomeValuesFrom( 

ObjectProperty ( : write) 
Class ( :book) 

) 

) 

Class ( : author) 

) 

This example shows the advantage of controlled natural languages like ACE over 
other logic languages. While the latter two statements require a considerable 
learning effort to be understood, the statement in ACE is very easy to grasp 
even for a completely untrained reader. We could show in an experiment that 
untrained users (who have no particular background in knowledge representation 
or computational linguistics) are able to understand ACE sentences very well and 
within very short time [10 . 

2 AceWiki 

We developed AceWiki that is a logic-centered semantic wiki that tries to solve 
the identified problems of existing semantic wikis by using a subset of ACE 
as its knowledge representation language. The goal of AceWiki is to show that 
semantic wikis can be more natural and at the same time more expressive than 
existing systems. Figure [l] shows a screenshot of the AceWiki interface. The 
general approach is to provide a simple and natural interface that hides all 
technical details. 

AceWiki has been introduced in [H] and Since then, many new fea- 
tures have been implemented: support for transitive adjectives, abbreviations for 
proper names, passive voice for transitive verbs, support for comments, client- 
side OWL export, a completely redesigned lexical editor, and proper persistent 



http://aHempto.ifi.uzln.ch/acewiki/geo/ 



Navigation: 

- Main Page 

- Index 

- Random Articfe 

- Search: 



Actions: 

- New Word.. 

- Export 



<Back\ Forward > \ Refresh \ 

Article | Noun | References | Individuals | Hierarchy |~ 

planet 



We use hens the definition of "planet" according to the International Astronomical Union 

(see http://www.iaij.orff ) without the restriction to soiar planets. 

Every planet is a celestial-body. 

No planet is a star. 

No planer. Is a dwarf-planet. 

The distinction between planet and dwarf-planet has been introduced by the 
International Astronomical Union In 2006. 
No planet is a moon. 
Every planet orbits a star. 

Every planet that does not orbit the Sun is an extrasolar planet. 
Which planets orbit the Sun? 

- Earth 

- Jupiter 

- Mars 

- Mercury 

- Neptune 

- Saturn 

- Uranus 

- Venus 



Every planet is a terrestrial planet or is a gas giant. 



Fig. 1. This screenshot of the AceWiki interface shows an article about planets. 
Articles in AceWiki consist of declarative ACE sentences and ACE questions 
(both in black color) and of unrestricted natural language comments (in gray 
color). 



storage of the wiki data. There is a public demo available^ and the source code 
of AceWiki can be downloaded under an open source license. However, AceWiki 
has not yet reached the stage where it could be used for real-world applications. 
Crucial (but scientifically not so interesting) parts are missing: history/undo 
facility, user management, and ontology import. 

One of the most interesting new features in AceWiki is the support for com- 
ments in unrestricted natural language, as it can be seen in Figure [l] Since it is 
unrealistic that all available information about a certain topic can be represented 
in a formal way, such comments can complement the formal ACE sentences. The 
comments can contain internal and external links, much like the text in tradi- 
tional non-semantic wikis. 

In order to enable the easy creation of ACE sentences, users are supported 
by an intelligent predictive text editor [IT] that is able to look ahead and to 
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Sentence Editor 



Every city that is located in a country that is 
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Fig. 2. This is the predictive editor of AceWiki. The partial sentence "every 
city that is located in a country that is ..." has already been entered and now the 
possible continuations are shown. In this way, the users can conveniently create 
syntactically correct sentences without learning the language in advance. 



show the possible words and phrases to continue the sentence. Figure [2] shows a 
screenshot of this editor. 

AceWiki supports an expressive subset of ACE. Some examples of sentences 
that can be created in AceWiki are shown here: 

O Every country that borders no sea is a landlocked-country. 

O Switzerland borders exactly 5 countries. 

O No city that is located in Europe is controlled by the USA. 

O If X borders Y then Y borders X. 

O Every moon orbits a planet or orbits a dwarf-planet. 

AceWiki is designed to seamlessly integrate a reasoner that can give feedback 
to the users, ensures the consistency of the ontology, can show its semantic 
structure, and answers queries. At the moment, we are using the OWL reasoner 
Pellet]^] and apply the ACE-to-OWL translator that is described in [5]. However, 
AceWiki is not restricted to OWL and another reasoner or rule engine might be 
used in the future. 

The subset of ACE that is used in AceWiki is more expressive than OWL, 
and thus the users can assert statements that have no OWL representation. 

8 http://clarkparsia.com/pellet/ 



Because we are using an OWL reasoner at the moment, such statements are not 
considered for reasoning. In order to make this clear to the users, the sentences 
that are outside of OWL are marked by a red triangle: 

O No ocean borders every continent. 

O Every person that has a car owns the car or leases the car. 

f> If Berlin is a capital then Germany is a stable country. 

[> Every trip that starts at X and that ends at X is a round trip. 

The most important task of the reasoner is to check consistency because 
only consistent ontologies enable to calculate logical entailments. In previous 
work [5], we explain how consistency is ensured in AceWiki by incrementally 
checking every new sentence that is added. 

Not only asserted but also inferred knowledge can be represented in ACE. At 
the moment, AceWiki shows inferred class hierarchies and class memberships. 
The hierarchy for the noun "country", for example, could look as follows: 

Upward 

O Every country is an area. 
O Every country is an object. 

Downward 

O Every alpine country is a country. 
O Every baltic-state is a country. 

Furthermore, ACE questions can be formulated within the articles. Such ques- 
tions are evaluated by the reasoner and the results are listed directly after the 
question: 

O Which cities are located in a country that borders Switzerland? 

- Berlin 

- Milano 

- Paris 

- Rome 

- Vienna 

& What is Switzerland? 

- an alpine country 

- an area 

- a country 

- a landlocked-country 

- an object 

If the question asks for a certain individual (represented in ACE by proper 
names) then the named classes (represented by nouns) of the individual are 
shown as the answer. In the cases where the question asks for a class (represented 
by a noun phrase), the individuals that belong to this class are shown as the 
answer. 

Thus, AceWiki uses ACE in different ways: as an expressive knowledge rep- 
resentation language for asserted knowledge, to display entailed knowledge gen- 
erated by the reasoner, and as a query language. 



Word Creator 



S 



Proper Name Noun Of-Construct Verb Transitive Adjective 




singular 
plural 



Every noun represents a certain type of things. For example, the noun "credit card" 
stands for all things that are credit cards. Every noun has a singular form and a plural 
form. 



|mounta!n 

examples; woman, credit card, process 



|mountains 

examples: women, credit cards, processes 



Fig. 3. The lexical editor of AceWiki helps the users to define the word forms. 
The example shows how a new noun "mountain" is created. 



In AceWiki, words have to be defined before they can be used. At the mo- 
ment, five types of words are supported: proper names, nouns, transitive verbs, 
o/-constructs (i.e. nouns that have to be used with o/-phrases), and transitive 
adjectives (i.e. adjectives that require an object). A new feature of AceWiki is 
that proper names can have an abbreviation that has exactly the same meaning 
as the long proper name. This is very helpful for proper names that arc too long 
to be spelled out each time. 

Figure [3] shows the lexical editor of AceWiki that helps the users in creating 
and modifying word forms in an appropriate way. An icon and an explanation 
in natural language help the users to choose the right category. 



3 Evaluation 

In order to find out how usable and how useful AceWiki is, we performed several 
tests. From time to time, we set up small usability experiments to test how well 
normal users are able to cope with the current version of AceWiki. Two such 
experiments have been performed so far. In order to find out whether AceWiki 
can be useful in the real world, we additionally conducted a small case study in 
which we tried to formalize the content of the existing Attempto project website 
in AceWiki. The results are explained in the following sections. Table [l] shows 
an overview. 



3.1 Usability Experiments 

Two usability experiments have been performed so far on AceWiki. The first one 
took place in November 2007 and has been described and analyzed in [5]. The 
second experiment — that is introduced here — was conducted one year later in 
November 2008. Both experiments have the nature of cheap ad hoc experiments 



Table 1. This table compares the settings of the three tests that have been 
performed on AceWiki. 





Experiment 1 


Experiment 2 


Case Study 


time 


Nov 2007 


Nov 2008 


Nov/Dec 2008 


AceWiki version 


0.2.1 


0.2.9 


0.2.10 


number of subjects (n) 


20 


6 


1 


subjects 


mostly students 


students 


AceWiki developer 


level of preexisting knowl- 


none 


low 


highest 


edge about AceWiki 








domain to be represented 


"the real world" 


universities 


Attempto project 



with the goal to get some feedback about possible weak points of AceWiki. 
Since the settings of the two experiments were different and since the number 
of subjects was relatively low, we cannot draw strong statistical conclusions. 
Nevertheless, these experiments can give us valuable feedback about the usability 
of AceWiki. 

In both experiments, the subjects were told to create a formal knowledge 
base in a collaborative way using AceWiki. The task was just to add correct and 
meaningful knowledge about the given domain without any further constraints 
on the kind of knowledge to be added. The subjects — mostly students — had 
no particular background in formal knowledge representation. The domain to 
be represented was the real world in general in the first experiment, and the 
domain of universities (i.e. students, departments, professors, etc.) in the second 
experiment. 

In the first experiment, the subjects received no instructions at all how 
AceWiki has to be used. In the second experiment, they attended a 45 minutes 
lesson about AceWiki. Another important difference is that the first experiment 
used an older version of AceWiki where templates could be used for the creation 
of certain types of sentences (e.g. class hierarchies). This has been removed in 
later versions because of its lack of generality. 

Table [2] shows the results of the two experiments. Since the subjects worked 
together on the same knowledge base and could change or remove the contri- 
butions of others, we can look at the results from two perspectives: On the one 
hand, there is the community perspective where we consider only the final result, 
not counting the sentences that have been removed at some point and only look- 
ing at the final versions of the sentences. On the other hand, from the individuals 
perspective we count also the sentences that have been changed or removed by 
another subject. The different versions of a changed sentence count for each of 
the respective subjects. However, sentences created and then removed by the 
same subject are not counted, and only the last version counts for sentences 
that have been changed by the same subject. 

The first part of the table shows the number and type of sentences the sub- 
jects created. In total, the resulting knowledge bases contained 179 and 93 sen- 



Table 2. This table shows the results of the first (Exp. 1) and the second (Exp. 
2) experiment. The results can be seen from the individuals perspective (ind.) 
or the community perspective (comm.) 
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Exp. 2 
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148 


145 


76 


73 
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Q1 






U -L 


sentences using "a" instead of "every" 


s e 


9 


9 


23 


12 


sentences using misclassified words 


gw 


9 


8 





() 


other incorrect sentences 


s~ 


20 


17 


14 


g 


7r\ C\\ PO.WPPt" QPTlfPTlPPQ 


S+ /g 


OVJ /(J 


81% 


67% 


78% 


% of (almost) correct sentences 


(S + + S e )/S 


84% 


86% 


88% 


91% 


% of complex sentences 


S+/S+ 


61% 


61% 


71% 


70% 


total words created 


w 


170 


170 


53 


50 


individuals (i.e. proper names) 


w p 


44 


44 


11 


10 


classes (i.e. nouns) 


Wn 


81 


81 


14 


14 


relations total 


Wr 


45 


45 


28 


26 


transitive verbs 


w v 


39 


39 


20 


18 


o/-constructs 


W 


6 


6 


2 


2 


transitive adjectives 


w a 






6 


6 


sentences per word 


S/w 


1.09 


1.05 


2.13 


1.86 


correct sentences per word 


S + /w 


0.87 


0.85 


1.43 


1.46 


total time spent (in minutes) 


t 


930.9 


930.9 


360.2 


360.2 


av. time per subject 


t/n 


46.5 


46.5 


60.0 


60.0 


av. time per correct sentence 


t/S+ 


6.3 


6.4 


4.7 


4.9 


av. time per (almost) correct sentence 


t/(S+ + S e ) 


5.9 


6.0 


3.6 


4.2 



tences, respectively. We checked these sentences manually for correctness. S + 
stands for the number of sentences that are (1) logically correct and (2) sensible 
to state. 

The first criterion is simple: In order to be classified as correct, the sentence 
has to represent a correct statement about the real world using the common 
interpretations of the words and applying the interpretation rules of ACE. 

The second criterion can be explained best on the basis of the sentences of 
the type S e . These sentence start with "a ..." like for example "a student studies 
at a university". This sentence is interpreted in ACE as having only existential 
quantification: "there is a student that studies at a university". This is certainly a 
logically correct statement about the real world, but the writer probably wanted 
to say "every student studies at a university" which is a more precise and more 
sensible statement. For this reason, such statements are not considered correct, 
even though they are correct from a purely logical point of view. 

Sentences of the type S e have been identified in [5] as one of two frequent error 
types when using AceWiki. The other one — denoted by S w — are sentences 




A country is an area. 



Your sentence "a country is interpreted as "there is a 
country that ... ". Do you want to say "every country ... "? 



Fig. 4. This figure shows a solution to the problem that the users often state 
sentences starting with "a ..." when they should say "every This is included 
in the latest version of AceWiki. 



using words in the wrong word category like for example "every London is a city" 
where "London" has been added as a noun instead of a proper name. 

It is interesting that the incorrect sentences of the types S e and S w had the 
same frequency in the first experiment, but evolved in different directions in the 
second experiment. There was not a single case of S" i '-mistakes in the second 
experiment. This might be due to the fact that we learned from the results of 
the first experiment and enriched the lexical editor with icons and explanations 
(see Figure |3| . 

On the other hand, the number of ^-mistakes increased. This might be 
caused by the removal of the templates feature from AceWiki. In the first ex- 
periment, the subjects were encouraged to say "every ..." because there were 
templates for such sentences. In the second experiment, those templates were 
not available anymore and the subjects were tempted to say "a" instead of 
"every" . This is bad news for AceWiki, but the good news is that there are two 
indications that we are on the right track nevertheless. First, while none of the 
5 e -sentences has been corrected in the first experiment, almost half of them have 
been removed or changed by the community during the second experiment. This 
indicates that some subjects of the second experiment recognized the problem 
and tried to resolve it. Second, the S^-sentences can be detected and resolved in 
a very easy way. Almost every sentence starting with "a ..." is an »S e -sentence 
and can be corrected just by replacing the initial "a" by "every" . After the sec- 
ond experiment, we added a new feature to AceWiki that asks the users each 
time they create a sentence of the form "a ..." whether it should be "every ...". 
The users can then say whether they really mean "a ..." or whether it should 
be rather "every In the latter case the sentence is automatically corrected. 
Figure [4] shows a screenshot of the dialog shown to the users. Future experiments 
will show whether this solves the problem. 

An interesting figure is of course the ratio of correct sentences S + / S. As it 
turns out, the first experiment exhibits the better ratio for both perspectives: 
80% versus 67% for the individuals and 81% versus 78% for the community. 
However, since 5 e -sentences are easily detectable and correctable (and hopefully 
a solved problem with the latest version of AceWiki), it makes sense to have a 



look at the ratio of "(almost) correct" sentences consisting of the correct sen- 
tences S+ and the ^-sentences. This ratio was better in the second experiment: 
84% versus 88% for the individuals perspective; 86% versus 91% for the com- 
munity perspective. However, the settings of the experiments do not allow us to 
draw any statistical conclusions from these numbers. Nevertheless, these results 
give us the impression that a ratio of correct and sensible statements of 90% and 
above is achievable with our approach. 

Another important aspect is the complexity of the created sentences. Of 
course, syntactically and semantically complex statements are harder to con- 
struct than simple ones. For this reason, we classified the correct sentences ac- 
cording to their complexity. S£ stands for all correct sentences that are complex 
in the sense that they contain a negation ("no", "does not", etc.), an implication 
( "every" , "if . . . then" , etc.) , a disjunction ( "or" ) , a cardinality restriction ( "at most 
3", etc.), or several of these elements. While the ratio of complex sentences was 
already very high in the first experiment (around 60%), it was was even higher 
in the second experiment reaching 70%. Looking at the concrete sentences the 
subjects created during the second experiment, one can see that they managed 
to create a broad variety of complex sentences. Some examples are shown here: 

— Every lecture is attended by at least 3 students. 

— Every lecturer is a professor or is an assistant. 

— Every professor is employed by a university. 

— If X contains Y then X is larger_than Y. 

— If somebody X likes Y then X does not hate Y. 

— If X is a student and a professor knows X then the professor hates X or likes X or is 
indifferent_to X. 

The last example is even too complex to be represented in OWL. Thus, the 
AceWiki user interface seems to scale very well in respect to the complexity of 
the ontology. 

The second part of Table [2] shows the number and types of the words that 
have been created during the experiment. All types of words have been used by 
the subjects with the exception that transitive adjectives were not supported by 
the AceWiki version used for the first experiment. It is interesting to see that the 
first experiment resulted in an ontology consisting of more words than correct 
sentences, whereas in the second experiment the number of correct sentences 
clearly exceeds the number of words. This is an indication that the terms in the 
second experiment have been reused more often and were more interconnected. 

The third part of Table |2]takes the time dimension into account. On average, 
each subject of the first experiment spent 47 minutes, and each subject of the 
second experiment spent 60 minutes. The average time per correct sentence 
that was around 6.4 minutes in the first experiment was much better in the 
second experiment being only 4.9 minutes. We consider these time values very 
good results, given that the subjects were not trained and had no particular 
background in formal knowledge representation. 

In general, the results of the two experiments indicate that AceWiki enables 
unexperienced users to create complex ontologies within a short amount of time. 



Table 3. This table lists the results of the AceWiki case study where the content 
of the Attempto website was formalized in AceWiki. 



total sentences created 


5" 


538 


complex sentences 


S x 


107 


% of complex sentences 


SJS 


19.9% 


total words created 


w 


261 


individuals (i.e. proper names) 


w p 


184 


classes (i.e. nouns) 


w n 


46 


relations total 


w r 


31 


transitive verbs 


w v 


11 


o/-constructs 


Wo 


13 


transitive adjectives 


W a 


7 


sentences per word 


S/w 


2.061 


time spent (in minutes) 


t 


347.8 (= 5.8 h) 


av. time per sentence 


t/S 


0.65 (= 38.8 s) 



3.2 Case Study 

The two experiments presented above seem to confirm that AceWiki can be used 
easily by untrained persons. However, usability does not imply the usefulness 
for a particular purpose. For that reason, we performed a small case study to 
exemplify how an experienced user can represent a strictly defined part of real- 
world knowledge in AceWiki in a useful way. 

The case study presented here consists of the formalization of the content of 
the Attempto website]^] in AceWiki. This website contains information about the 
Attempto project and its members, collaborators, documents, tools, languages, 
and publications, and the relations among these entities. Thus, the information 
provided by the Attempto website is a piece of relevant real-world knowledge. 

In the case study to be presented, one person — the author of this paper 
who is the developer of AceWiki — used a plain AceWiki instance and filled 
it with the information found on the public Attempto website. The goal was 
to represent as much as possible of the original information in a natural and 
adequate way. This was done manually using the AceWiki editor without any 
kind of automation. 

Table [3] shows the results of the case study. The formalization of the website 
content took less than six hours and resulted in 538 sentences. This gives an 
average time per sentence of less than 40 seconds. These results give us some 
indication that AceWiki is not only usable for novice users but can also be used 
in an efficient way by experienced users. 

Most of the created words are proper names (i.e. individuals) which is not 
surprising for the formalization of a project website. The ratio of complex sen- 
tences is much lower than the ones encountered in the experiments but with 
almost 20% still on a considerable level. 
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Attempto Project 



Attempts is a research project of the University of Zurich with the objective to develop Attempto Controlled English 
(ACE) and its tools. The project Attempto is jointly supported by the Department of Informatics and the Institute of 
Computational Linguistics , 

Attempto project 



O The Attempto project is a research project that is a part of the University of Zurich. 

O The Attempto project is affiliated with the Department of Informatics and is affiliated with 

the Institute of Computational Linguistics, 
O The Attempto project is dedicated to Attempto Controlled English that is a controlled natural language. 

Fig. 5. This figure shows an example text that occurs on the Attempto website 
(top) and how it was represented in AceWiki (bottom). 



Basically, all relevant content of the Attempto website could be represented 
in AceWiki. Of course, the text could not be taken over verbatim but had to 
be rephrased. Figure [5] exemplary shows how the content of the website was 
formalized. The resulting ACE sentences are natural and understandable. 

However, some minor problems were encountered. Data types like strings, 
numbers, and dates would have been helpful but are not supported. ACE itself 
has support for strings and numbers, but AceWiki does not use this feature so 
far. Another problem was that the words in AceWiki can consist only of letters, 
numbers, hyphens, and blank spacej*°) Some things like publication titles or 
package names contain colons or dots which had to be replaced by hyphens in 
the AceWiki representation. We plan to solve these problems by adding support 
for data types and being more flexible in respect to user-defined words. 

Figure [6] shows a wiki article that resulted from the case study. It shows how 
inline queries can be used for automatically generated and updated content. This 
is an important advantage of such semantic wiki systems. The knowledge has to 
be asserted once but can be displayed at different places. In the case of AceWiki, 
such automatically created content is well separated from asserted content in a 
natural and simple manner by using ACE questions. 

As can be seen on Figure [6] the abbreviation feature for proper names has 
been used extensively. The answer lists show the abbreviations in parentheses 
after the long proper names. The most natural name for a publication, for ex- 
ample, is its title. However, sentences that contain a spelled-out publication title 
become very hard to read. In such cases, abbreviations are defined which can be 
used conveniently to refer to the publication. 

The fact that the AceWiki developer is able to use AceWiki in an efficient 
way for representing real world knowledge does of course not imply that every 



Blank spaces are represented internally as underscores. 



Tobias Kuhn 



l> Tobias Kuhn is a person 

l> Tobias Kuhn is a member of tbe Attempto group. 
I> Which tools are developed by Tobias Kuhn^ 

- AceRules 

- AceWiki 

- Attempto Parsing Engine (APE] 

l> Which languages are developed by Tobias Kuhn? 
-Attempto Controlled English (ACE) 

t> Tobias Kuhn is an author of which publications? 

- AceWiki - A Natural and Expressive Semantic Wiki (SWUIOBKuhn) 

- Attempto Controlled English for Knowledge Representation (RWOBFuchs) 

- Collaborative Ontology Management in Controlled Natural Language (SemWikiOSKuhn) 

- Combining Semantic Wikis and Controlled Natural Language (ISWCOBKuhn) 

- Discourse Representation Structures for ACE 6-0 [DRS Report) 

- Executing Rules in Controlled Natural Language (RR07Kuhn) 

- Expressing Knowledge about Protein Interactions in Attempto Controlled English (DiplTh-Kuhn) 

- Improving Text Mining with Controlled Natural Language - A Case Study for Protein Interactions (DILS06Kuhn) 

- Writing Support for Controlled Natural Languages (ALTA08Kuhn) 

Fig. 6. This figure shows an exemplary wiki article that resulted from the case 
study. Inline queries are used to generate content that is automatically updated. 



experienced user is able to do so. However, we can see the results as an upper 
boundary of what is possible to achieve with AceWiki, and the results show that 
AceWiki in principle can be used in an effective way. 



4 Conclusions 



We presented the AceWiki system that should solve the problems that existing 
semantic wikis do not support expressive ontology languages and are hard to 
understand for untrained persons. AceWiki shows how semantic wikis can serve 
as online ontology editors for domain experts with no background in formal 
methods. 

The two user experiments indicate that unexperienced users are able to deal 
with AceWiki. The subjects managed to create many correct and complex state- 
ments within a short period of time. The presented case study indicates that 
AceWiki is suitable for formalization tasks of the real world and that it can 
be used — in principle — by experienced users in an efficient way. Still, more 
user studies are needed in the future to prove our claim that controlled natural 
language improves the usability of semantic wikis. 

In general, we showed how controlled natural language can bring the Semantic 
Web closer to the end users. The full power of the Semantic Web can only be 
exploited if a large part of the Web users are able to understand and extend the 
semantic data. 
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